55 research outputs found

    Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX'

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The frequency of a haplotype comprising one allele at each of two loci can be expressed as a cubic equation (the 'Hill equation'), the solution of which gives that frequency. Most haplotype and linkage disequilibrium analysis programs use iteration-based algorithms which substitute an estimate of haplotype frequency into the equation, producing a new estimate which is repeatedly fed back into the equation until the values converge to a maximum likelihood estimate (expectation-maximisation).</p> <p>Results</p> <p>We present a program, "CubeX", which calculates the biologically possible exact solution(s) and provides estimated haplotype frequencies, D', r<sup>2 </sup>and <it>χ</it><sup>2 </sup>values for each. CubeX provides a "complete" analysis of haplotype frequencies and linkage disequilibrium for a pair of biallelic markers under situations where sampling variation and genotyping errors distort sample Hardy-Weinberg equilibrium, potentially causing more than one biologically possible solution. We also present an analysis of simulations and real data using the algebraically exact solution, which indicates that under perfect sample Hardy-Weinberg equilibrium there is only one biologically possible solution, but that under other conditions there may be more.</p> <p>Conclusion</p> <p>Our analyses demonstrate that lower allele frequencies, lower sample numbers, population stratification and a possible |D'| value of 1 are particularly susceptible to distortion of sample Hardy-Weinberg equilibrium, which has significant implications for calculation of linkage disequilibrium in small sample sizes (eg HapMap) and rarer alleles (eg paucimorphisms, q < 0.05) that may have particular disease relevance and require improved approaches for meaningful evaluation.</p

    Genome-Wide Data-Mining of Candidate Human Splice Translational Efficiency Polymorphisms (STEPs) and an Online Database

    Get PDF
    Variation in pre-mRNA splicing is common and in some cases caused by genetic variants in intronic splicing motifs. Recent studies into the insulin gene (INS) discovered a polymorphism in a 5' non-coding intron that influences the likelihood of intron retention in the final mRNA, extending the 5' untranslated region and maintaining protein quality. Retention was also associated with increased insulin levels, suggesting that such variants--splice translational efficiency polymorphisms (STEPs)--may relate to disease phenotypes through differential protein expression. We set out to explore the prevalence of STEPs in the human genome and validate this new category of protein quantitative trait loci (pQTL) using publicly available data.Gene transcript and variant data were collected and mined for candidate STEPs in motif regions. Sequences from transcripts containing potential STEPs were analysed for evidence of splice site recognition and an effect in expressed sequence tags (ESTs). 16 publicly released genome-wide association data sets of common diseases were searched for association to candidate polymorphisms with HapMap frequency data. Our study found 3324 candidate STEPs lying in motif sequences of 5' non-coding introns and further mining revealed 170 with transcript evidence of intron retention. 21 potential STEPs had EST evidence of intron retention or exon extension, as well as population frequency data for comparison.Results suggest that the insulin STEP was not a unique example and that many STEPs may occur genome-wide with potentially causal effects in complex disease. An online database of STEPs is freely accessible at http://dbstep.genes.org.uk/

    An IGF-I promoter polymorphism modifies the relationships between birth weight and risk factors for cardiovascular disease and diabetes at age 36

    Get PDF
    OBJECTIVE: To investigate whether IGF-I promoter polymorphism was associated with birth weight and risk factors for cardiovascular disease (CVD) and type 2 diabetes (T2DM), and whether the birth weight – risk factor relationship was the same for each genotype. DESIGN AND PARTICIPANTS: 264 subjects (mean age 36 years) had data available on birth weight, IGF-I promoter polymorphism genotype, CVD and T2DM risk factors. Student's t-test and regression analyses were applied to analyse differences in birth weight and differences in the birth weight – risk factors relationship between the genotypes. RESULTS: Male variant carriers (VCs) of the IGF-I promoter polymorphism had a 0.2 kg lower birth weight than men with the wild type allele (p = 0.009). Of the risk factors for CVD and T2DM, solely LDL concentration was associated with the genotype for the polymorphism. Most birth weight – risk factor relationships were stronger in the VC subjects; among others the birth weight – systolic blood pressure relationship: 1 kg lower birth weight was related to an 8.0 mmHg higher systolic blood pressure CONCLUSION: The polymorphism in the promoter region of the IGF-I gene is related to birth weight in men only, and to LDL concentration only. Furthermore, the genotype for this polymorphism modified the relationships between birth weight and the risk factors, especially for systolic and diastolic blood pressure

    Steroid receptor coactivator-1 modulates the function of Pomc neurons and energy homeostasis

    Get PDF
    Hypothalamic neurons expressing the anorectic peptide Pro-opiomelanocortin (Pomc) regulate food intake and body weight. Here, we show that Steroid Receptor Coactivator-1 (SRC-1) interacts with a target of leptin receptor activation, phosphorylated STAT3, to potentiate Pomc transcription. Deletion of SRC-1 in Pomc neurons in mice attenuates their depolarization by leptin, decreases Pomc expression and increases food intake leading to high-fat diet-induced obesity. In humans, fifteen rare heterozygous variants in SRC-1 found in severely obese individuals impair leptin-mediated Pomc reporter activity in cells, whilst four variants found in non-obese controls do not. In a knock-in mouse model of a loss of function human variant (SRC-1L1376P), leptin-induced depolarization of Pomc neurons and Pomc expression are significantly reduced, and food intake and body weight are increased. In summary, we demonstrate that SRC-1 modulates the function of hypothalamic Pomc neurons, and suggest that targeting SRC-1 may represent a useful therapeutic strategy for weight loss.Peer reviewe

    Low-frequency variation in TP53 has large effects on head circumference and intracranial volume.

    Get PDF
    Cranial growth and development is a complex process which affects the closely related traits of head circumference (HC) and intracranial volume (ICV). The underlying genetic influences shaping these traits during the transition from childhood to adulthood are little understood, but might include both age-specific genetic factors and low-frequency genetic variation. Here, we model the developmental genetic architecture of HC, showing this is genetically stable and correlated with genetic determinants of ICV. Investigating up to 46,000 children and adults of European descent, we identify association with final HC and/or final ICV + HC at 9 novel common and low-frequency loci, illustrating that genetic variation from a wide allele frequency spectrum contributes to cranial growth. The largest effects are reported for low-frequency variants within TP53, with 0.5 cm wider heads in increaser-allele carriers versus non-carriers during mid-childhood, suggesting a previously unrecognized role of TP53 transcripts in human cranial development

    The UK10K project identifies rare variants in health and disease

    Get PDF
    M. KivimÀki työryhmÀjÀsen.The contribution of rare and low-frequency variants to human traits is largely unexplored. Here we describe insights from sequencing whole genomes (low read depth, 7x) or exomes (high read depth, 80x) of nearly 10,000 individuals from population-based and disease collections. In extensively phenotyped cohorts we characterize over 24 million novel sequence variants, generate a highly accurate imputation reference panel and identify novel alleles associated with levels of triglycerides (APOB), adiponectin (ADIPOQ) and low-density lipoprotein cholesterol (LDLR and RGAG1) from single-marker and rare variant aggregation tests. We describe population structure and functional annotation of rare and low-frequency variants, use the data to estimate the benefits of sequencing for association studies, and summarize lessons from disease-specific collections. Finally, we make available an extensive resource, including individual-level genetic and phenotypic data and web-based tools to facilitate the exploration of association results.Peer reviewe

    Combination of 768-well microplate array diagonal gel electrophoresis with duplex PCR of X and Y chromosome markers for quality control of epidemiological DNA banks

    No full text
    Large DNA banks for human epidemiological studies have become an increasingly important research tool. The power of genotype-phenotype studies is dependent both on the quality of phenotyping and of genotyping and of correct linking of phenotypes to genotypes. Samples must be tracked through numerous steps between subject or patient and post-genotypic data. Only one phenotype, sex, has a perfect and binary correlation with genotype. In mixed sex studies, it may be advantageous for purposes of quality control to keep sexes mixed during the steps from acquisition to DNA bank, in order to be able to check later for sample swaps. We have designed a duplex PCR combining an amplicon from MAOA marking the X chromosome and an amplicon from DDX3Y marking the Y chromosome. We combined this with a simple economical palmtop sized 768-well microplate compatible electrophoresis system developed in-house for examination of duplex PCR products. We applied this quality control test in the validation of two DNA banks
    • 

    corecore